Picture for Jianye Hao

Jianye Hao

Unveiling the Entropy Dynamics of Chain-of-Thought Reasoning

Add code
Jun 01, 2026
Viaarxiv icon

Opt-Verifier: Unleashing the Power of LLMs for Optimization Modeling via Dual-Side Verification

Add code
May 28, 2026
Viaarxiv icon

The Rank and Gradient Lost in Non-stationarity: Sample Weight Decay for Mitigating Plasticity Loss in Reinforcement Learning

Add code
Apr 02, 2026
Viaarxiv icon

K^2-Agent: Co-Evolving Know-What and Know-How for Hierarchical Mobile Device Control

Add code
Feb 28, 2026
Viaarxiv icon

Global Prior Meets Local Consistency: Dual-Memory Augmented Vision-Language-Action Model for Efficient Robotic Manipulation

Add code
Feb 22, 2026
Viaarxiv icon

ActionCodec: What Makes for Good Action Tokenizers

Add code
Feb 17, 2026
Viaarxiv icon

Short Chains, Deep Thoughts: Balancing Reasoning Efficiency and Intra-Segment Capability via Split-Merge Optimization

Add code
Feb 03, 2026
Viaarxiv icon

Why Attention Patterns Exist: A Unifying Temporal Perspective Analysis

Add code
Jan 29, 2026
Viaarxiv icon

Ratio-Variance Regularized Policy Optimization for Efficient LLM Fine-tuning

Add code
Jan 06, 2026
Viaarxiv icon

Enhancing the Medical Context-Awareness Ability of LLMs via Multifaceted Self-Refinement Learning

Add code
Nov 14, 2025
Viaarxiv icon